1,797 research outputs found

    Quantifying appearance retention in carpets using geometrical local binary patterns

    Get PDF
    Quality assessment in carpet manufacturing is performed by humans who evaluate the appearance retention (AR) grade on carpet samples. To quantify the AR grades objectively, different research based on computer vision have been developed. Among them Local Binary Pattern (LBP) and its variations has shown promising results. Nevertheless, the requirements of quality assessment on a wide range of carpets have not been met yet. One of the difficulties is to distinguish between consecutive AR grades in carpets. For this, we adopt an extension of LBP called Geometrical Local Binary Patterns (GLBP) that we recently proposed. The basis of GLBP is to evaluate the grey scale differences between adjacent points defined on a path in a neighbourhood. Symmetries of the paths in the GLBPs are evaluated. The proposed technique is compared with an invariant rotational mirror based LBP technique. The results show that the GLBP technique performs better to distinguish consecutive AR grades in carpets

    Kestävät jalat - kettujen jalkaterveyden kehityshanke. Raportti nro 3

    Get PDF
    Luken kirjat, raportit, oppaat ja esitteet. Raportti nro 3201

    Expanding the Family of Grassmannian Kernels: An Embedding Perspective

    Full text link
    Modeling videos and image-sets as linear subspaces has proven beneficial for many visual recognition tasks. However, it also incurs challenges arising from the fact that linear subspaces do not obey Euclidean geometry, but lie on a special type of Riemannian manifolds known as Grassmannian. To leverage the techniques developed for Euclidean spaces (e.g, support vector machines) with subspaces, several recent studies have proposed to embed the Grassmannian into a Hilbert space by making use of a positive definite kernel. Unfortunately, only two Grassmannian kernels are known, none of which -as we will show- is universal, which limits their ability to approximate a target function arbitrarily well. Here, we introduce several positive definite Grassmannian kernels, including universal ones, and demonstrate their superiority over previously-known kernels in various tasks, such as classification, clustering, sparse coding and hashing

    Face Detection with Effective Feature Extraction

    Full text link
    There is an abundant literature on face detection due to its important role in many vision applications. Since Viola and Jones proposed the first real-time AdaBoost based face detector, Haar-like features have been adopted as the method of choice for frontal face detection. In this work, we show that simple features other than Haar-like features can also be applied for training an effective face detector. Since, single feature is not discriminative enough to separate faces from difficult non-faces, we further improve the generalization performance of our simple features by introducing feature co-occurrences. We demonstrate that our proposed features yield a performance improvement compared to Haar-like features. In addition, our findings indicate that features play a crucial role in the ability of the system to generalize.Comment: 7 pages. Conference version published in Asian Conf. Comp. Vision 201

    LBP and irregular graph pyramids

    Get PDF
    In this paper, a new codification of Local Binary Patterns (LBP) is given using graph pyramids. The LBP code characterizes the topological category (local max, min, slope, saddle) of the gray level landscape around the center region. Given a 2D grayscale image I, our goal is to obtain a simplified image which can be seen as “minimal” representation in terms of topological characterization of I. For this, a method is developed based on merging regions and Minimum Contrast Algorithm

    The Conditional Lucas & Kanade Algorithm

    Full text link
    The Lucas & Kanade (LK) algorithm is the method of choice for efficient dense image and object alignment. The approach is efficient as it attempts to model the connection between appearance and geometric displacement through a linear relationship that assumes independence across pixel coordinates. A drawback of the approach, however, is its generative nature. Specifically, its performance is tightly coupled with how well the linear model can synthesize appearance from geometric displacement, even though the alignment task itself is associated with the inverse problem. In this paper, we present a new approach, referred to as the Conditional LK algorithm, which: (i) directly learns linear models that predict geometric displacement as a function of appearance, and (ii) employs a novel strategy for ensuring that the generative pixel independence assumption can still be taken advantage of. We demonstrate that our approach exhibits superior performance to classical generative forms of the LK algorithm. Furthermore, we demonstrate its comparable performance to state-of-the-art methods such as the Supervised Descent Method with substantially less training examples, as well as the unique ability to "swap" geometric warp functions without having to retrain from scratch. Finally, from a theoretical perspective, our approach hints at possible redundancies that exist in current state-of-the-art methods for alignment that could be leveraged in vision systems of the future.Comment: 17 pages, 11 figure

    MinMax Radon Barcodes for Medical Image Retrieval

    Full text link
    Content-based medical image retrieval can support diagnostic decisions by clinical experts. Examining similar images may provide clues to the expert to remove uncertainties in his/her final diagnosis. Beyond conventional feature descriptors, binary features in different ways have been recently proposed to encode the image content. A recent proposal is "Radon barcodes" that employ binarized Radon projections to tag/annotate medical images with content-based binary vectors, called barcodes. In this paper, MinMax Radon barcodes are introduced which are superior to "local thresholding" scheme suggested in the literature. Using IRMA dataset with 14,410 x-ray images from 193 different classes, the advantage of using MinMax Radon barcodes over \emph{thresholded} Radon barcodes are demonstrated. The retrieval error for direct search drops by more than 15\%. As well, SURF, as a well-established non-binary approach, and BRISK, as a recent binary method are examined to compare their results with MinMax Radon barcodes when retrieving images from IRMA dataset. The results demonstrate that MinMax Radon barcodes are faster and more accurate when applied on IRMA images.Comment: To appear in proceedings of the 12th International Symposium on Visual Computing, December 12-14, 2016, Las Vegas, Nevada, US

    A good balance of costs and benefits: convincing a university administration to support the installation of an interactive multi-application display system on campus

    Get PDF
    Interactive digital signage systems allow passers-by to take (temporary) control of a public display in order to select content and applications of interest, or even upload content of their own. Not surprisingly, display owners are hesitant to embrace such interactivity, given the uncertainty of what will be shown on their displays. In this paper we summarize our experience of deploying an interactive multi-application display system in the context of a university environment, and in particular our engagements with display owners (i.e., university administration) in order to convince them and get their support for the installation and deployment of such a system. We present the results of semi-structured interviews with display owners regarding their motivations, needs, and concerns with respect to the deployment of such a system at our university. While one cannot generalize from our results, we nevertheless believe that our experiences offer helpful advice to developers of such systems (and/or researchers interested in designing and studying them) in order to aid them in successfully gathering the support of these important stakeholders
    corecore